Adapting A Synonym Database To Specific Domains
نویسندگان
چکیده
This paper describes a method for adapting a general purpose synonym database, like WordNet, to a specific domain, where only a subset of the synonymy relations defined in the general database hold. The method adopts an eliminative approach, based on incrementally pruning the original database. The method is based on a preliminary manual pruning phase and an algorithm for automatically pruning the database. This method has been implemented and used for an Information Retrieval system in the aviation domain.
منابع مشابه
Adapting the OCMiner text processing system to the CTD controlled vocabulary
We adapted OCMiner, a modular text processing pipeline especially suited for high-speed processing of large document collections, to a specific controlled vocabulary as given by the Comparative Toxicogenomic Database (CTD). We provide a RESTful web service which processes documents given in the BioCreative XML format and annotates them with domainspecific terms from the CTD domains genes, chemi...
متن کاملDeveloping a Standardized Medical Speech Recognition Database for Reconstructive Hand Surgery
Fast and holistic access to the patients’ clinical record is a major requirement of modern medical decision support systems (DSS). While electronic health records (EHRs) have replaced the traditional paper-based records in most healthcare organization, the data entry into these systems remains largely manual. Speech recognition technology promises substitution of the more convenient speech-base...
متن کاملRanking and Selecting Synsets by Domain Relevance
The paper presents a novel method for domain specific sense assignment. The method determines the domain specific relevance of GermaNet synsets on the basis of the relevance of their constituent terms that cooccur within representative domain corpora. The approach is task independent and completely automatic. Experiments show results on three selected domains: business, soccer and medical.
متن کاملNavigation of Biomedical Literature
Data for the initial gene synonym dictionary was collected from publicly available databases, e.g. LocusLink (Pruitt et al., 2000), FlyBase, or UniProt (Apweiler et al., 2004). In addition to primary gene symbols and names, when known, this initial dictionary also contained alternative synonyms and orthographic variants. Although genome databases, were the principal sources of primary symbols a...
متن کاملCompatibility of B-Sheets with Epitopes Predicted by Immunoinformatic in Human IgG
Background & Aims: Antibodies, well-known as immunoglobulins (Igs), are produced by B lymphocytes and specifically defend against pathogens. Igs are glycoproteins and have high diagnostic value in several diseases including infections (1). Igs are composed of light and heavy chains (2, 3). Each chain is comprised of about 110-120 amino acid residues which create immunoglobulin folds named domai...
متن کامل